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ABSTRACT 

In order to enhance DNA triple helix stability 
synthetic oligonucleotides have been developed 
that bear amino groups on the sugar or base. 
One of the most effective of these is bis-amino-U 
(B), which possesses 5-propargylamino and 
2 -aminoethoxy modifications. Inclusion of this 
modified nucleotide not only greatly enhances 
triplex stability, but also increases the affinity for 
related sequences. We have used a restriction 
enzyme protection, selection and amplification 
assay (REPSA) to isolate sequences that are bound 
by the heavily modified 9-mer triplex-forming oligo- 
nucleotide B 6 CBT. The isolated sequences contain 
A„ tracts (n = 6), suggesting that the 5 -end of 
this TFO was responsible for successful triplex 
formation. DNase I footprinting with these 
sequences confirmed triple helix formation at 
these secondary targets and demonstrated no 
interaction with similar oligonucleotides containing 
T or 5-propargylamino-dU. 

INTRODUCTION 

Triple-stranded nucleic acids are generated by the 
sequence-specific binding of oligonucleotides within the 
duplex major groove (1-3). The third strand bases form 
specific hydrogen bond contacts with the Hoogsteen face 
of the duplex purines and these structures are therefore 
usually restricted to oligopurine.oligopyrimidine tracts. 
The third strand can bind either parallel or antiparallel 
to the duplex purine strand, depending on its sequence. 
CT-containing oligonucleotides bind in a parallel arrange- 
ment, generating C + .GC and T.AT triplets (4,5), while 
GT- or GA-containing third strands bind antiparallel 
forming A.AT, G.GC and T.AT triplets (6). Although 
the parallel motif requires conditions of low pH, necessary 
for protonation of the third strand cytosine, this has been 
more widely studied than the antiparallel form as 



purine-rich oligonucleotides can form other competing 
structures. 

Although triplex formation occurs with high specificity, 
these complexes are usually less stable than their duplex 
counterparts, largely due to electrostatic repulsion 
between the three negatively charged strands. A number 
of strategies have been employed to enhance the binding, 
including attachment of other DNA binding groups, such 
as intercalators (7-10), addition of exogenous triplex- 
binding ligands (11-15) or inclusion of modified nucleo- 
tides that bear charged groups on either the sugar (16,17) 
or the base (18-22). The most successful of these nucleo- 
tide analogues is bis-amino-U (B), Figure 1A, which 
contains a propargylamino group attached to the 
5-position of the base and a 2'-aminoethoxy group 
attached to the sugar (23-27). This analogue combines 
the two modifications, which had previously been inde- 
pendently shown to enhance triplex formation. The 
propargylamino group is thought to interact with the 
adjacent 5'-phosphate on the third strand, while 
the aminoethoxy group interacts with a phosphate in 
the duplex purine strand (19). Inclusion of a single B 
residue within an 18-mer triplex-forming oligonucleotide 
(TFO) increased the melting temperature by 8°C at pH 6.0 
(25) and facilitated triplex formation at higher pH values 
(28). Studies on the specificity of TFOs containing one B 
residue showed that the B.GC triplet is weaker than B.AT 
(1 1°C lower melting temperature), but the interaction with 
TA and CG was no stronger than the mismatched triplets 
formed with T (25). B therefore has enhanced discrimin- 
ation against pyrimidines, but has the same selectivity 
between A and G as the natural base T. These previous 
studies on specificity used TFOs that only contain one B 
residue; we were concerned to examine the specificity of 
TFOs that contain multiple B-residues, since these may 
possess a net positive charge and might therefore show 
reduced selectivity, interacting with secondary sites. 

One method for identifying the preferred sites of 
DNA-binding ligands is Restriction Endonuclease 
Protection, Selection and Amplification (REPSA), which 
was first described by Hardenbol and van Dyke (29-33). 
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GAGAAAAAAGA ACTGGTTG CGTAA TTTT CATCCG TAACG. . .3' 



Figure 1. (A) Chemical structures of bis-amino-U and propargylamino-dU. (B) DNase I footprint showing the interaction of B-TFO with the 
/j7-T(43-59) fragment. The experiment was performed in 50 mM sodium acetate pH 5.0; TFO concentrations (uM) are shown at the top of each gel 
lane. The DNA was labelled at the 3'-end; the gel therefore runs 5'-3' from top to bottom. The lane labelled control shows digestion of the DNA in 
the absence of oligonucleotide; GA is a marker specific for purines. The exact target site is shown by the filled box and the accompanying 
enhancement is indicated by an asterisks. Secondary binding sites are shown by the unfilled boxes and the enhancements are indicated by bracketed 
asterisks. (C) Sequence of the oligopurine tract in />rT(43-59) and its interaction with the modified TFOs. The exact target site is boxed. 
(D) Sequence of a part of the /jtT(43-59) fragment showing the position of the secondary binding sites, which are underlined and the enhancements, 
which are shown in bold. The exact target site is boxed. 



This technique has previously been used to study the inter- 
action of proteins, small molecules and oligonucleotides 
with DNA. Since it does not require the physical separ- 
ation of the bound and unbound templates, it is ideal for 
studying non-covalent interactions. Like SELEX, REPSA 
uses a population of synthetic DNA templates, which 
contains a central region of randomized bases that are 
challenged by the ligand. A recognition site for a type 
IIS restriction enzyme, such as Fok I (GGATG), is 
included in the template adjacent to this random region. 
Since type IIS restriction enzymes cut at a fixed distance 
from their binding site they can be used to cleave a region 
with an unknown sequence. This makes REPSA superior 
to other similar methods as no prior knowledge of the 
target sequence is required. When these random duplex 
targets are incubated with the ligand, a small number 
will contain the correct binding sites and will be protected 
from enzyme cleavage. The majority of the templates will 
not be bound by the ligand and so will be cleaved. The 
reaction mixture is then subjected to polymerase chain 
reaction (PCR), in which only the bound (uncleaved) tem- 
plates will undergo exponential amplification. Each round 
of selection therefore enriches the oligonucleotide pool for 
templates that contain the ligand-binding site and after 
several rounds of selection the remaining templates are 
cloned and sequenced. 

This study examines the binding of a short 9-mer triplex 
forming oligonucleotide, which contains 7-bis-amino-U 
residues. We first examine its interaction with a 



fragment that contains its predicted binding site, 
and then use REPSA to isolate other sequences to 
which it binds. The interaction with these secondary 
binding sites is then confirmed by DNase I footprinting 
and compared with oligonucleotides that contain T or 
propargylamino-dU . 

MATERIALS AND METHODS 

Oligonucleotides 

All oligonucleotides were synthesized on an Applied 
Biosystems ABI 394 automated DNA/RNA synthesizer 
on the 0.2- or 1-umol scale using the standard cycles of 
acid-catalysed detritylation, coupling, capping and iodine 
oxidation procedures. Phosphoramidite monomers and 
reagents were purchased from Applied Biosystems or 
Link Technologies. The phosphoramidites for B and 
propargylamino-U (P) were prepared as previously 
described. (19, 27) The 9-mer triplex-forming oligonucleo- 
tide used in the REPSA experiments had the sequence 
5-BBBBBBCBT, while 5' -TTTTTTCTT and 
5'-PPPPPPCPT were also used in the footprinting experi- 
ments. The naphthylquinoline triplex-binding ligand was 
provided by Dr L. Strekowski, Dept. Chemistry, Georgia 
State University, Atlanta. 

REPSA. REPSA was performed as previously described 
(29). The selection template used for these experiments 
had the sequence 5'-GTAGGATCCTGACT GGATG 
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AAN j 5 TCTTCCTGACAAGCTTCAG-3' (the underlined 
bases show the Fokl recognition sequence). Double- 
stranded DNA was generated from this by annealing the 
primer 5' -CTGA AGCTTGTC AGG AAG A-3 ' , which had 
been labelled with 32 P at the 5' -end, and extending with 
Tag polymerase. The double-stranded template containing 
the 15-bp random sequence was then purified on a 6% 
non-denaturing polyacrylamide gel and resuspended in 
10 ill Fok I buffer (50 mM potassium acetate, 20 mM 
Tris-acetate pH 7.9, lOmM magnesium acetate, 1 mM 
dithiothreitol). Ten microlitres of 10 -uM oligonucleotide 
5'-BBBBBBCBT, dissolved in 50 mM sodium acetate pH 
5.0, was then added and the mixture was equilibrated for 
1 h at 20°C (the final pH was about 5.6). The complex was 
then cleaved with 8 U Fokl for 5 min at 37°C and the 
products of the reaction were subjected to PCR. 
Unbound sequences were cleaved by the enzyme and 
were not amplified by PCR while those protected by the 
triplex-forming oligonucleotide were selectively amplified. 
By including the Fokl recognition sequence (GGATG) in 
the primer sequence, we avoided the selection of any 
nuclease-resistant products that arise from mutations in 
the restriction enzyme recognition site. PCR amplification 
was performed by mixing lOul of the cleavage reaction 
with 15 |xl of the labelled forward primer 5'-CTGAAGC 
TTGTCAGGAAGA, 2 ul of unlabelled 5'-GTAGGATC 
CTGACTGGATG (80 uM), 1 ul 25 mM dNTPs, 5ul 
lOxTaq buffer and 5 U of Tag polymerase in a total 
volume of 50 ul. The mixture was subject to 15 cycles of 
93°C (30 s) and 50°C (3 min). A further two cycles of PCR 
were then performed after adding 1 ul dNTPs, 2.5 ul un- 
labelled forward primer and 5 U of Tag polymerase. This 
pulse-chase procedure, using a low concentration of the 
radiolabeled primer for the initial cycles, ensures efficient 
incorporation of the radiolabel, while preventing the for- 
mation of any mismatched duplexes, which would have 
arisen from the annealing of PCR products. The amplified 
DNA was run on an 8% non-denaturing polyacrylamide 
gel from which the full-length products were purified. The 
purified template, enriched in binding sites for the TFO, 
was redissolved in 10 ul of Fokl buffer and subjected to a 
further round of REPSA selection. The procedure was 
repeated 10 times. After the tenth round of selection, the 
products were cleaved with BamHI and Hindlll and 
ligated into pUC19 that had also been cleaved with 
BamHI and Hindlll. The ligated plasmid was transformed 
into Escherichia coli TG2 and plated on agar plates con- 
taining 0.02% X-gal, 1 mM IPTG and lOOug/ml carbeni- 
cillin. White colonies were picked from these plates and 
sequenced. Colonies were first sequenced manually using a 
T7 sequencing kit (USB), although the sequences of all the 
fragments used for the footprinting experiments were sub- 
sequently confirmed by Eurofins MWG Operon, 
Ebersberg, Germany. 

DNA fragments for footprinting. Plasmids containing the 
REPSA templates were cleaved with EcoRI and labelled 
at the 3'-end with a- 32 P[dATP] using reverse transcriptase. 
The fragment of interest was then released by cutting with 
Hindlll, purified on a 6% polyacrylamide gel and 
redissolved in 10 mM Tris-HCl pH 7.5, containing 



0.1 mM EDTA at a concentration of at least lOc.p.s per 
microlitre (as determined using a hand-held Geiger 
counter). The tyrT(43-59) restriction fragment, which 
contains an 18-bp oligopurine tract that includes the 
target site for these 9-mer TFOs, was prepared as previ- 
ously described (34). 

DNase I footprinting. DNase I footprinting was per- 
formed as previously described (35). Briefly, 1.5 ul of 
radiolabeled DNA was mixed with 3 ul of oligonucleotide 
dissolved in 50 mM sodium acetate pH 5.0 to give a final 
concentration of between 0.03 and 1 uM and incubated 
overnight at 20°C. The mixture was cleaved by adding 
2 ill DNase 1 (typically 0.01 U/ml) dissolved in 20 mM 
NaCl, 2mM MgCl 2 and 2mM MnCl 2 , stopping the 
reaction after 2 min with 4.5 ul formamide containing 
lOmM EDTA and bromphenol blue. The products of 
the reaction were separated on a 9% denaturing polyacryl- 
amide gel containing 8M urea. After electrophoresis the 
gels were fixed, dried and subjected to autoradiography 
using a phosphorimager screen (Kodak), which was 
scanned with a Storm phosphorimager. 

RESULTS 

The 1 8-bp oligopurine tract in fyrT(43-59) has often been 
used to examine the binding of modified triplex-forming 
oligonucleotides (19,24,25,36,37). Single base mutations 
within this target have been used to examine the binding 
selectivity of oligonucleotides that contain one or more 
bis-amino-U (B) residues (25). These studies have shown 
that B has a very high affinity for AT relative to all other 
Watson-Crick base pairs in DNA and that each B.AT 
triplet increases the melting temperature by about 8°C 
relative to T.AT. Binding to GC is much weaker and 
B-modified oligonucleotides show enhanced discrimin- 
ation against YR compared with T. However, we were 
concerned that oligonucleotides that contain multiple B 
substitutions, generating TFOs with a net positive 
charge, might show reduced selectivity and interact with 
other secondary sites. We therefore examined the inter- 
action of the 9-mer TFO 5'-BBBBBBCBT, which is 
expected to have a formal charge of +5 at physiological 
pH, with its target site in ?j';T(43-59) (Figure 1C) by 
DNase I footprinting and the results are shown in 
Figure IB. As expected, a clear footprint is produced at 
the intended target (indicated by the filled box), which 
persists to nanomolar concentrations. As previously 
noted, this footprint is accompanied by enhanced DNase 
I cleavage at the 3'-(lower) end of the target site 
(19,24,36,38). In addition to this footprint, other regions 
of protection are evident at the highest TFO concentration 
(1 uM) as indicated by the open boxes, and the primary 
footprint is much larger, extending throughout the entire 
oligopurine tract. Enhanced cleavage is also observed at 
the 3'-ends of each of these secondary sites. The location 
of these secondary sites is shown in Figure ID. No sec- 
ondary binding was observed with the unmodified TFO 
5'-TTTTTTCTT or the propargylamino-U-modified 
5'-PPPPPPCPT (data not shown). Visual inspection of 
these secondary sites does not reveal any obvious 
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common sequence characteristics. REPSA experiments 
were therefore carried out to select the preferred binding 
sites for this oligonucleotide from a pool of random 
templates. 

Since the conditions of the REPSA experiments are 
limited by the pH requirement of the selecting type IIS 
restriction enzyme (Fokl) we first examined the pH de- 
pendence of these secondary sites (data not shown). 
Although the primary footprint was still evident at 
pH 7.0, the secondary sites had largely disappeared at 
pH 6.0 and above (Supplementary Figure SI). These sec- 
ondary sites also disappeared within 1 min when the pH 
was rapidly raised from pH from 5.0 to 6.0 or 7.0. A low 
pH was therefore employed in these experiments in order 
to observe the secondary sites, although we observed that 
Fokl cleavage was at least 100-fold slower at pH 5.0 and 
6.5 than at pH 7.9. 

REPSA 

REPSA selection was performed with Fokl using the 
random template in the presence of 5 uM B-substituted 
TFO. This TFO concentration is higher than might 
normally be employed in gene inhibition studies, but was 
used in order to exaggerate the proportion of secondary 
binding sites. Out of 77 colonies picked for sequencing, 
45 contained an insert that could not be fully sequenced 
(possibly because they adopt unusual structures). The 
other 32 colonies were sequenced and the sequences of 
the N 15 random regions are shown in Supplementary 
Table SI. As the aim of this work was to compare the 
secondary binding sites with the exact target sequence, 
we only examined templates that showed some similarity 
to the exact target. Fourteen of these sequences, each of 
which contained an oligo(dA) tract, were therefore chosen 
for footprinting (Table 1). Sequence 6 is included as it was 



Table 1. Sequences of the central regions of 
clones obtained from REPSA selection, corres- 
ponding to the N 15 region in the random 



template 


Number 


Sequence 


1 


CAATCAAAAAAACAA 


2 


AAAAAAAAGTTGCTCT 


3 


CCCCGAACAAAAAAA 


4 


CAAAAAAAGAGAGAC 


5 


TATTGCTAAAAAAGG 


6 


AAATAAACGGCCATTG 


7 


CCAGAGCACAAAAAACAA 


8 


GGTAAAAAACCGGCA 


9 


CAACTCAAAAAATCA 


10 


C C C TAAARATAAC AA 


11 


GCAGTAAAAAAACTA 


12 


AAGAAAAGCAATGGGATCT 


13 


CACACGCAAAAAAG 


14 


AAGAAAAAACAGCTCCTAC 



For sequences 2, 4, 12 and 14 the first two As 
correspond to the bases in the flanking sequence. 
The strand shown is the one that is labelled on the 
footprinting gels and is the complementary strand 
to the original template strand. The oligo(dA) 
tracts are given in bold. 



originally thought to have a run of six As but was 
re-sequenced when the B-TFO failed to produce a foot- 
print and was found to contain a pyrimidine interruption. 
We considered that the other sequences were less likely to 
bind the TFO and are likely to be random sequences, 
which were not screened out by REPSA or which were 
refractory to Fokl cleavage for other reasons. Such se- 
quences might have been eliminated with further rounds 
of REPSA, but we were concerned to avoid introducing 
mutations in the template sequences. Indeed, it can be seen 
that some of the sequences are not the same length as the 
original random template; this is an acknowledged limita- 
tion of the REPSA technique. The relatively high level of 
unselected sequences may also arise because Fokl is not 
producing efficient double-strand cleavage under these 
conditions, allowing the persistence of some unbound 
templates. 

An interesting pattern is evident in these oligo(dA)-con- 
taining sequences; the selected templates were all in the 
same orientation, with the purine-rich strand opposite 
to the GGATG of the Fokl site (i.e. on the same strand 
as CATCC). As a consequence, all the footprinting experi- 
ments described below display the purine-rich strand of 
the target. None of the isolated fragments were in the 
opposite orientation with oligo(T) on the GGATG 
-containing strand. This is presumably a property of the 
selecting enzyme Fokl and will be considered further in 
the 'Discussion' section. 

DNase I footprinting 

Radiolabeled DNA fragments containing 14 of the 
REPSA-selected sequences were prepared and subjected 
to DNase I footprinting in the presence of varying con- 
centrations of 5'-BBBBBBCBT (B-TFO), 5'-PPPPPPCPT 
(P-TFO) or 5'-TTTTTTCTT (T-TFO). In each case, the 
radiolabeled DNA fragment and the TFO were incubated 
overnight at 20°C in 50 mM sodium acetate pH 5.0. The 
results for the B-substituted TFO are shown in Figures 2 
and 3. Since no footprints were observed with the 
P-substituted TFO at concentrations up to 30 uM, these 
experiments were repeated in the presence of 10 uM 
naphthylquinoline triplex-binding ligand and these 
results are presented in Figure 4. No footprints were 
evident with the unmodified T-TFO on any of these se- 
quences at concentrations up to 30 uM, even in the 
presence of the triplex-binding ligand (data not shown). 
The results for the various targets will be described in 
turn, while the positions of the various footprints are 
summarized in Table 2. The addition of 5mM MgCl 2 gen- 
erally made no difference to the position of the footprints, 
merely enhancing the strength of interaction with the 
P-TFO. In a few instances, this produced slightly different 
cleavage patterns with P-TFO in the presence of the 
triplex ligand and these are presented in Supplementary 
Figure S2. 

Sequence 1. This sequence contains the closest match to 
the perfect target site with a single G-to-C mutation, and 
contains a run of seven contiguous As followed by CAA. 
DNase I cleavage patterns show that the B-TFO produces 
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Figure 2. DNase I footprints showing the interaction of B-TFO with REPSA-selected sequences 1-7. The experiments were performed in 50mM 
sodium acetate pH 5.0; TFO concentrations (uM) are shown at the top of each gel lane. The DNA was labelled at the 3'-end; the gel therefore runs 
5—3' from top to bottom. The lanes labelled 'control' show digestion of the DNA in the absence of oligonucleotide; tracks labelled 'GA' are markers 
specific for purines. The locations of the footprints are indicated by the filled boxes and any accompanying enhancements are indicated by asterisks. 



8 



10 



11 



12 



13 



14 




Figure 3. DNase I footprints showing the interaction of B-TFO with REPSA-selected sequences 8-14. The experiments were performed in 50 mM 
sodium acetate pH 5.0; TFO concentrations (uM) are shown at the top of each gel lane. The DNA was labelled at the 3'-end; the gel therefore runs 
5—3' from top to bottom. The lanes labelled 'control' show digestion of the DNA in the absence of oligonucleotide; tracks labelled 'GA' are markers 
specific for purines. The locations of the footprints are indicated by the filled boxes and any accompanying enhancements are indicated by asterisks. 



a footprint down to a concentration of around 0.3 uM, 
even though this fragment does not contain its correct 
binding site, confirming that REPSA had successfully 
selected a TFO-binding site. Unusually, compared to 
many of the sequences described below, no enhanced 
cleavage is evident at the 3'-(lower) triplex-duplex 



junction. This fragment contains a run of seven consecu- 
tive As and the lower end of this footprint is coincident 
with the 3'-end of this tract. The upper end of the foot- 
print extends beyond the A-tract by 2-3 bases, as shown in 
Table 2. Triple helix protection from DNase I cleavage 
does not arise from direct steric blockage, as the enzyme 
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Figure 4. DNase I footprints showing the interaction of P-TFO with REPSA-selected sequences 1-14 in the presence of 10 uM naphthylquinoline 
triplex-binding ligand. The experiments were performed in 50 mM sodium acetate pH 5.0; P-TFO concentrations (uM) are shown at the top of each 
gel lane. The lanes labelled 'control' show digestion of the DNA in the absence of oligonucleotide; tracks labelled 'GA' are markers specific for 
purines. The locations of the footprints are indicated by the filled boxes and any accompanying enhancements are indicated by asterisks. 



and third strand are located in different grooves. A typical 
triplex footprint extends for 3-4 bases above (5'-) the 
target site, while the lower (3') end of the footprint 
matches the end of the triplex, and is often 
accompanied by enhanced cleavage of the terminal 
base at the triplex-duplex junction (19,24,36,38). Based 
on these observations, it seems reasonable to suggest 
that the B-TFO is bound to the A 7 tract and this will 
be considered further in the 'Discussion' section. In 
contrast, no footprints were observed with the T- or 



P-TFOs at concentrations up to 30 uM, although 
addition of 10 uM of the naphthylquinoline 
triplex-binding ligand induces a clear footprint in the 
same region with the P-TFO, which persists to a con- 
centration of about 3 uM (Figure 4). This interaction is 
further enhanced by the addition of magnesium chloride 
(5mM); in these conditions, the footprint persists to 
about 0.3 uM and is accompanied by enhanced 
cleavage at the terminal A in the sequence CAA 
(Supplementary Figure S2). 
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Table 2. Locations of the footprints and enhancements observed with 
the REPSA footprinting fragments 



Number Sequence 



1 AGGAAGACAAT CAAAAAA ACAA TTCATCC 

2 AGGAA GAAAAAAA AGTTGCTCT TTCATCC 

3 AGGAAGAC C C C G AAC AAAAAA ATTCATCC 

4 AGGAAGA C AAAAAAA GAGAQAC TTCATCC 

5 AGGAAGATATTG CTAAAAAA GG TTCATCC 

I AGGAAGAC C AGA GC AC AAAAAAC AACCCTCA TCC 

8 AGGAAGA GGTAAAAAA CCGGCAttcatcc 

9 AGGAAGACAACTCA AAAAA TCATTCATCC 

10 AGGAAGACCCTAAAAATAACAA TTCATCC 

I I AGGAAGAGCA GTAAAAAAA CTATTCATCC 

12 A GGAAGAAAA GC AATGGGATCT TTCATCC 

13 AGGAAGAC AC ACGCAAAAAAG TTCATCC 

14 AGGAAGAAAAAACAGCTCCTAC TTCATCC 



The selected sequences are shown in large type, while the adjacent 
flanking regions in the template are shown in smaller type and italics. 
Bases shown in bold correspond to positions of enhanced cleavage, 
while the footprints are underlined. 



Sequence 2. This sequence contains nine contiguous As 
followed by a G. The B-substituted TFO produces a foot- 
print at the highest concentration (1 uM), which is 
accompanied by two enhancements at the 3' -end of the 
oligopurine tract. A similar footprint can be seen with 
the P-TFO in the presence of the triplex ligand (at a 
TFO concentration of 30 uM), although the two enhance- 
ments are shifted one base further down compared with 
the B-TFO. The presence of two enhanced bands may 
indicate that the TFO can be bound in more than one 
location, each of which produces enhanced cleavage at 
its 3'-end. 

Sequence 3. This sequence contains seven contiguous As 
followed by three pyrimidines (TTC). The B-TFO 
produces a footprint at the highest concentration (1 uM), 
although one band is still evident within this region 
(Figure 2). This protection is accompanied by three 
enhanced bands at the lower (3') end of this region, 
which may again suggest multiple binding locations on 
this target. The P-TFO does not produce a footprint 
with this sequence, although addition of the triplex 
ligand produces a footprint in a similar location at the 
highest concentration (30 uM). Three enhanced cleavage 
products are again evident at the 3'-end of this footprint, 
although their relative intensities are different to those 
seen with the B-TFO alone. 

Sequence 4. This sequence contains seven contiguous As 
followed by three purines (GAG). The B-TFO produces a 
clear footprint on this template at a concentration of 1 uM 
(Figure 2), which is accompanied by a single enhancement 
at the guanine at the 3'-end of the A-tract. An unusual 
feature is that three bands below this enhancement also 
show attenuated cleavage. The B- and T-TFOs did not 
produce footprints on this sequence, although the addition 
of the triplex ligand generated a footprint with 10 uM 
P-TFO. This ligand-induced footprint is accompanied 



by several enhanced bands and is one base shorter at the 
3'-end than with the B-TFO. 

Sequence 5. This sequence contains six contiguous As 
followed by GGT. The B-TFO produces a clear footprint 
on this template at a concentration of 1 uM (Figure 2), 
which is accompanied by two enhanced cleavage products 
at the 3'-(lower) end of the footprint corresponding to the 
terminal A and the following G. The P-TFO does not 
produce a footprint on this template, even at a concentra- 
tion of 30 uM, although the addition of 10 uM triplex 
ligand produces a clear footprint, which persists to 
about 3 uM, and is very similar to that produced by the 
B-TFO alone. Further addition of 5 mM MgCl 2 causes the 
footprint to extend one base further in the 3'-(lower) dir- 
ection, with enhancement in the two guanines 
(Supplementary Figure S2). 

Sequence 6. This sequence contains an interrupted A-tract 
(AAATAAAC). None of the TFOs altered the DNase I 
cleavage pattern of this template under any conditions 
(Figures 2 and 4). 

Sequence 7. This sequence is similar to sequence 1 , but the 
A-tract is one base shorter (A 6 CAA). Once again the 
B-TFO produces a footprint within the REPSA-selected 
region, which is evident at the highest TFO concentration 
(1 uM). This is accompanied by a faint enhancement at the 
terminal A. No footprints were produced with the T- and 
P-TFOs alone, although the addition of 10 uM triplex 
ligand produced a clear footprint with 10 uM P-TFO, 
with no evident enhanced cleavage (as noted with 
sequence 1). 

Sequence 8. This sequence contains six contiguous As 
followed by CCG. The B-TFO produces a DNase I foot- 
print at the highest concentration (1 uM) (Figure 3) which 
is accompanied by enhanced cleavage at the terminal A of 
the A 6 -tract. The T- and P-TFO did not produce foot- 
prints, although in the presence of 10 uM triplex ligand 
30 uM P-TFO generated a similar footprint to that 
produced by B-TFO alone. 

Sequence 9. This sequence contains six contiguous 
As followed by TCA. The B-TFO produces attenuated 
DNase I cleavage within this A-tract, which is 
accompanied by four enhancements at the 3'-end 
(Figure 3). The T- and P-TFOs do not affect the 
cleavage pattern at concentrations as high as 30 uM, but 
a clear footprint is produced with 30 uM P-TFO in the 
presence of lOuM triplex ligand, which is also 
accompanied by several enhancements. 

Sequence 10. This sequence contains only five contiguous 
As followed by TAA. The B-TFO again produces a foot- 
print with this target at a concentration of 1 uM, which is 
accompanied by three bands of enhanced cleavage at the 
3'-end. In this case, no footprints are produced by the T- 
or P-TFOs even in the presence of 10 uM triplex ligand, 
although further addition of 5 mM MgCl 2 to the P-TFO 
plus triplex ligand produces a footprint that is similar to 
that seen with B-TFO alone (Supplementary Figure S2). 
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Sequence 11. This sequence contains seven contiguous As 
followed by CTA. The B-substituted TFO produces a 
footprint at a concentration of 0.3 uM with two 
enhanced bands at the 3'-end of the A-tract. No footprints 
were produced with T- and P-TFO alone. The addition of 
10 uM triplex ligand produces a footprint with P-TFO at a 
concentration of 3 uM in the same position as seen with 
B-TFO alone. This is accompanied by enhanced cleavage 
though the precise pattern of these enhancements is not 
the same as that with B-TFO. 

Sequence 12. In this sequence the oligo-A tract is inter- 
rupted by a single guanine and so the longest A-tract 
contains only four contiguous As. The B-TFO attenuates 
DNase I cleavage within this purine tract at a concentra- 
tion of 1 uM and produces a single enhanced cleavage 
product at a G at its 3'-end. The P- and T-TFOs did not 
affect the DNase I cleavage pattern, even in the presence 
of the triplex ligand. 

Sequence 13. This sequence contains six contiguous As 
followed by GTT. The B-TFO produces a footprint on 
this template at a concentration of 1 uM, which is 
accompanied by two bands of enhanced cleavage that 
are coincident with the terminal AG. In the presence of 
10 uM triplex ligand the P-TFO produces a similar foot- 
print that persists to a concentration of 3 uM. The same 
two enhancements as seen with the B-TFO are also 
evident. 

Sequence 14. This sequence contains six contiguous As 
followed by CAG. The B-TFO produces a clear footprint 
within this oligopurine tract, although no enhanced 
cleavage products are evident. A similar pattern is 
evident with P-TFO in the presence of 10 uM triplex 
ligand. 

DISCUSSION 

The results presented in this paper clearly show that this 
heavily modified B-substituted TFO can bind to second- 
ary sites. The interaction is not non-selective, but high 
affinity binding is observed at DNA sites that contain 
triplex mismatches and several of the REPSA-selected 
targets bind the B-TFO at sub-micromolar concentra- 
tions, even though they do not contain the exact target 
site. This is particularly striking in view of the short length 
of the TFO, which is only a 9-mer. This interaction 
appears to be restricted to the bis-amino-U modification 
since T- or P-containing TFOs showed no interaction with 
these target sites at concentrations up to 30 uM (although 
some sequences showed interaction with P-TFO in the 
presence of the triplex-binding ligand). 

The exact location of the TFO cannot be accurately 
deduced from these footprints as DNase I cleavage is gen- 
erally poor within these oligopurine tracts. However, 
most, though not all, of the selected sequences contain 
an A-tract that is 6 bp long. We assume that this corres- 
ponds to interaction with the six consecutive B-residues in 
the TFO. Sequence 6, for which the longest A-tract is only 
3 bp, shows no evidence of triplex formation. The two A 3 



tracts in this sequence, which are separated by a single T, 
are clearly not sufficient to generate a secondary binding 
site. Sequence 12 only contains an A 4 tract, although this 
is preceded by AAG and one of the bis-amino-U residues 
in the B 6 -tract must be interacting with this GC base pair. 
This seems reasonable since our previous work has shown 
that B can bind to an isolated GC interruption (about 
15-fold weaker than to AT, but much stronger than to 
TA or CG) (25). The position of the enhancement with 
this sequence suggests that the 3'-G is the terminal base of 
this triplex, forming a standard C + .GC triplet. Sequence 
10 also contains a shorter A-tract (A 5 ), although this is 
followed by TAA. The enhancements seen with this foot- 
print suggest that the TFO can bind in more than one 
location, although the strongest enhancement is at the 
first A following the TA interruption. In contrast, 
sequence 9 also possesses an A 6 tract, yet produced one 
of the weakest footprints. 

These results suggest that the binding of this TFO is 
dominated by the run of six consecutive bis-amino-U 
residues. The last three bases at the 3'-end of the TFO 
(CBT) seem to be less important for sequence recognition. 
This is emphasized by the observation that several of the 
selected A-tracts are flanked by a 3'-G, at which there is 
enhanced DNase I cleavage (sequences 2, 4, 5, 12 and 13), 
suggesting that even the C is not involved in the secondary 
triplex binding site. The lower stringency at the 3'-end of 
the TFO contrasts with earlier work that employed 
REPS A to examine the selectivity of a 1 9-mer 
GT-oligonucleotide, for which the 13 bases at the 3'-end 
of the TFO were shown to be sufficient for binding (30). 
However, this previous study concerned antiparallel 
triplexes, and so also involved the 5'-end of the target 
site. It also employed natural nucleotides with a lower 
affinity, whereas in the present study the 5'-end of the 
TFO is more heavily modified than the 3'-end to achieve 
high-affinity interactions. 

Previous work with bis-amino-U studied its interaction 
with single base mismatches and showed that, as well as 
forming strong complexes with AT, it also formed a 
weaker triplet with GC, but showed enhanced discrimin- 
ation against CG and TA, compared with the unmodi- 
fied nucleotide T (25). It is therefore not surprising 
that these secondary sites are all based on A-tracts; 
other mismatched triplet combinations do not appear 
to benefit from addition of the 2'-aminoethoxy or 
5-propargylamino-dU substitutions. The specific benefits 
of these additions are also evident from the observation 
that B-containing TFOs do not stabilize the triplexes 
formed with parallel or antiparallel GT-containing oligo- 
nucleotides. It should be noted that non-specific binding 
of triplex-forming oligonucleotides can also occur by 
looping out regions of the TFO (39). This is unlikely 
with this short 9-mer oligonucleotide, but we cannot 
exclude the possibility that one or more of the third 
strand nucleotides is looped out from the duplex. 

This study successfully isolated a number of clones, 
which clearly represent secondary binding sites for this 
TFO. However, none of these sequences represents the 
exact target site (A 6 GAA), although sequence 1 is only 
one base different (changing a G to C). At first sight this 
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is surprising, although it is likely to be the result of the 
TFO concentration (5 uM) used in these experiments, 
which is much higher than the affinity for the exact 
target site. These results further demonstrate the very 
high affinity of B for A-rich duplex DNA. Low concen- 
trations of B-containing TFOs will therefore need to be 
used in potential in vivo applications, reducing the inter- 
action with any secondary sites. 

Enhanced DNase I cleavage is often observed at the 
triplex-duplex boundary (19,24,38). This is only seen on 
the duplex purine strand (never on the pyrimidine strand), 
but is not observed with all sequences. For example, se- 
quences 1 and 14 show no evidence of enhanced DNase I 
cleavage. Although both these sequences contain an 
adjacent ApC, which is not present in any of the other 
sequences, this alone cannot be sufficient to explain the 
lack of enhancement as the original target site in 
0'f'(43-59) is also flanked by ApC. Although we are not 
sure of the origin of this difference, which may result from 
the A-B transition in the underlying duplex at the triplex- 
duplex junction, it has proved very useful in these experi- 
ments for delineating the 3'-end of the target site. The 
presence of multiple enhancements has not been 
observed in previous triplex footprints and we interpret 
this as indicating that the TFO can bind in different pos- 
itions within these targets. 

It is also interesting to note that all the cloned sequences 
are in the same orientation, in which the purine strand of 
the duplex is evident on the footprinting gels (the 
T-containing strand is on the same strand as the Fokl 
site GGATG). There is no reason in principle why the 
strands could not be reversed, producing fragments with 
a T„ tract on the labelled strand. This must be a function 
of the way in which Fokl cleaves the target triplex. This 
enzyme is known to interact with two copies of its target 
site before it can cleave the DNA; DNA cleavage only 
occurs after two DNA-bound enzymes interact with 
each other (40). Since our fragments only contain one 
Fokl binding site, this interaction must occur between 
two separate DNA fragments, either or both of which 
might be bound by the TFO. The effect cannot be ex- 
plained by suggesting that the enzyme only cleaves one 
strand of the underlying duplex, as the remaining strand 
would still be amplified by the PCR. It therefore appears 
that under these conditions Fokl may still be able to 
cleave both duplex strands of the triplex formed at GG 
ATG . . . A,„ while a similar triplex generated at GGATG 
. . . T„ is refractory to the enzyme. It is also possible that 
this strand selectivity might be caused by the effect of low 
pH on the Fokl enzyme, if the protonation state of the 
active site of a Fokl monomer on one strand is different to 
the protonation state on other monomer. 

It should be noted that in all these experiments the 
triplex was formed at pH 5.0, before digesting with Fokl 
at a pH of about 5.6. These non-physiological conditions 
were chosen in order to enhance the affinity of this short 
(9-mer) oligonucleotide for its target. Stronger binding 
could have been achieved by using longer B-containing 
oligonucleotides, which would normally be employed 
in biological applications, but this would have 
increased the number of potential secondary binding 



sites. The affinity of this TFO for its exact target site is 
pH dependent, as a result of the single third strand 
cytosine, with dissociation constants of <100nM at pH 
5.0, about 0.3 uM at pH 6.0 and 1 uM at pH 7.0. The 
footprinting experiments show that the affinities for 
these secondary sites are typically 0.3-1 uM at pH 5.0. 
Binding to these sites decreases at higher pH values, 
although we assume that the relative discrimination 
between target and non-target sites is not pH sensitive. 
For in vivo applications of modified triplex-forming oligo- 
nucleotides, the concentrations employed may well be 
lower than the affinity for these secondary sites, and so 
may not be a problem for such short sequences. However, 
longer TFOs may show significant interaction with sec- 
ondary sites at these concentrations. Other applications 
of triplexes, for instance, in detection/diagnostics or 
nanostructure formation (41), may well involve conditions 
of low pH, as they are not limited by physiological con- 
straints. These applications with heavily modified TFOs 
might be affected by annealing to secondary binding sites. 
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